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ABSTRACT: Severe acute respiratory syndrome coronavirus (SARS-CoV) is a newly emergent virus 
responsible for a worldwide epidemic in 2003. The coronavirus spike proteins belong to class I fusion 
proteins, and are characterized by the existence of two heptad repeat (HR) regions, HR and HR2. The 
HRI region in coronaviruses is predicted to be considerably longer than that in other type I virus fusion 
proteins. Therefore the exact binding sequence to HR2 from the HR1 is not clear. In this study, we defined 
the region of HRI that binds to HR2 by a series of biochemical and biophysical measures. Subsequently 
the defined HR1 (902-952) and HR2 (1145~1184) chains, which are different from previously defined 
binding regions, were linked together by a flexible linker to form a single-chain construct, 2-Helix. This 
protein was expressed in Escherichia coli and forms a typical six-helix coiled coil bundle. Highly conserved 
HR regions hetween mouse hepatitis virus (MHV) and SARS-CoV spike proteins suggest a similar three- 
dimensional structure for the two fusion cores. Here, we constructed a homology model for SARS 
coronavirus fusion core based on our biochemical analysis and determined the MHV fusion core structure, 
We also propose an important target site for fusion inhibitor design and several strategies, which have 
been successfully used in fusion inhibitor design for human immunodeficiency virus (HIV), for the treatment 


of SARS infection. 


‘A new coronavirus has been identified as the distinct 
pathological entity responsible for severe acute respiratory 
syndrome (SARS), which infected more than 8000 people 
and killed 774 worldwide in 2003. The SARS-associated 
coronavirus (SARS-CoV) is not closely related to any of the 
previously characterized coronaviruses isolated from either 
‘humans or animals and has therefore been assigned to a new 
‘group, group IV, along with the previous three known groups 
(,2) 

"The family Coronaviridae is a group of enveloped positive- 
stranded RNA viruses with the largest genome among the 
RNA viruses, and the members are characterized by 34 
envelope proteins embedded on the surface of the viral 
envelope lipid membrane (3, 4). The genome sequencing 
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reveals that, as with other enveloped RNA viruses including 
the coronaviruses, SARS-CoV envelope spike (S) protein 
contains highly conserved heptad repeat regions (HRI and 
HR2), which have been shown to be important in virus 
‘membrane fusion, Such heptad repeat regions have been 
successfully used as targets for virus entry/fusion inhibitors 
in a number of viruses (5, 6). The existence of HR regions 
is also a characteristic of class | viral fusion proteins, which 
is different from class II (two classes, class I and Il, have 
been classified at present) (7), It is generally believed that 
the envelope protein undergoes a series of conformational 
‘changes during the virus fusion process, ‘The HR and HR2 
regions are believed to be important modules/domains in this 
process and show different conformations in different fusion 
states (6). Under the current model, there are at least three 
conformational states of the envelope fusion protein, ‘They 
are the prefusion native state, prehairpin intermediate state, 
and postfusion hairpin state (6, 8). During these state 
twansitions, HRI and HR2 are exposed in an intermediate 
conformational state but bind to each other to form a coiled 
coil structure in the postfusion state, Therefore the in vitro 
introduced HR peptides compete with their endogenous HR 
counterparts in the intermediate state, preventing the transi- 
tion into the formation of HRI/HR2 coiled coil bundle in 
the postfusion state. Some HR peptides or analogues have 
been shown to be potent inhibitors for virus fusion (6, % 
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Picure 1: Schema 


diagram of SARS-CoV spike protein with the location of structurally si 


sant domains indicated. $1 and $2 are 


formed alter proteolytic cleavage (vertial arow) and are noncovalenlly linked. The eaveloped protein fis an N-terminal signal sequence 
(SS) and a transmembrane domain (TM) adjacent to the C terminus. 82 contains two HR (hepta repeat) regions (hatched bars), HR and 
HR2 as indicated. The residues of HRI (8981005) and HR? (1145~1184) used in this study were derived from the LearmCoil-VMF 
prediction program (20). NT1—NT6 are six truncations derived form HRI to test forthe exact region that binds HR2 in HR. The 2-Helix 
protein construct consists of HR2 and part of HRI, which is the major region binding HR2, connected by an eight amino acid linker as 


Indicated 


10). This cviled coil bundle conformation is believed to be 
Important for bringing two lipid membranes (cellular and 
Viral) into proximity with each other, thus enabling the 
membrane fusion for virus entry. Membrane fusion is the 
key step for enveloped virus infection, The HR /HR2 coiled 
coil bundle is called the virus fusion protein fusion core (6). 
Several crystal structures of fusion cores have been deter- 
mined to date, including human immunodeficiency virus 
(HIV) gp41, human respiratory syncytial virus (HRSV) F, 
influenza vis HA, and Ebola virus GP (8, 11~15). Typical 
of this kind of structure, three copies of HRI bind together 
to form a central trimeric core and three copies of HR2, 
‘which is shorter than HRI, surround this core. Because both 
HRI and HR? are o-helical inthis fusion core, this structure 
is also referred to as a six-helix coiled coil bundle, 

To investigate the structural basis of SARS-CoV fusion 
‘and entry and identify new fusion inhibitors, we have carried 
‘out the analysis of the SARS-CoV HRI and HR2 fusion core. 
In this study, both HR1 and HR2 have een expressed as a 
GST-fusion proteins, and their binding regions have been 
analyzed. A single-chain polypeptide of HRI and HR2 
(linked by a flexible amino acid linker), named 2-Helix, has 
also been engineered and analyzed, The engineered 2-Helix 
shows that SARS-CoV § protein is a typical class 1 viral 
fusion protein, as observed recently for the murine coro- 
navirus, mouse hepatitis virus (MHV) (16), 

‘Although sequence similarity is not very high in other 
regions of spike proteins, HR regions between MHV and 
SARS-CoV spike proteins are highly conserved. Based on 
‘our biochemical analysis and determined MHV fusion core 
structure, a SARS coronavirus fusion core model was 
‘constructed. Strategies that have been successfully used in 
fusion inhibitor design for HIV could also be used in the 
design of SARS fusion inhibitors (6, 9 10, 17, 18). 
Therefore, using our fusion core model. we have proposed 
important target sites for the fusion inhibitor design and 


several potential inhibitors that may be useful for treating 
SARS. 

‘Tripet and co-workers have defined the binding regions 
of HRI and HR2 as residues 916-950 of HRI and residues, 
1151~1185 of HR2 (19). In contrast, our results show that 
the binding regions of HRI and HR2 are residues 902-952 
in HRI and 1145—1184 in HR2. There ate several significant 
differences between the two definitions of the fusion core. 
We will give a reasonable interpretation in the further 
discussion. 


EXPERIMENTAL PROCEDURES 


Gene Construction. The SARS-CoV § gene was cloned 
from SARS-CoV coronavirus strain BJ302 clone 1 (GenBank 
accession no, AY429072). The HRI and HR? regions were 
predicted using the computer program LearnCoil-VMF (20) 
‘The HRI region covers amino acids 898~ 1005, whereas the 
HR2 covers amino acids 1145—1184 (Figure 1). To test the 
actual HRI region bound to HR2, six additional HRI 
constructs, named as NTI—NT6, were made (Figure 1). Their 
lengths are shown in Figure 1. All PCR-amplified fragments 
\were cloned into pGEX-6P-1 GST-fusion expression vector 
(Pharmacia) by restriction enzyme sites EcoRI and Xhol 
(introduced by PCR primers). For the 2-Helix construct. the 
HRI region used was derived from amino acids 900 to 957 
and HR2 from 1145 to 1184. The 2-Helix construct was 
made by linking the HR1 and HR2 with an eight amino acid 
linker (GGSGGSGG, single amino acid abbreviation used 
here). All the cloned constructs were verified by direct DNA 
sequencing, 

Protein Expression and Purification. AlL the relevant 
positive expression vectors were transformed into Escheri- 
chia coli strain BL21 (DE3) competent cells and single 
colony was inoculated into 2x YT medium containing 100 
ug/ml. ampicilin at 37 °C for overnight culture. Then the 
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Picture 2: GST-pulldown experiments: (a) purified GST-fusion protein (NTI-NTS, lanes 16) for contol; (b) lanes 
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6, pulldown 


results of GST-fusion protein NT1—NT6 with HR2, respectively: lane 7, HR? for contol. The samples were subjected to Tris-tricine 10% 
SDS-PAGE analysis. Lane M contains molecular weight standards in kD, 


overnight cultue was transfered to new 2 YT medium 
for large-scale protein production at 37 °C. When the culture 
density (ODax) reached 018, the culture was induced with 
(0.2 mM isopropyl--o-thiogalactopyranoside (IPTG) and 
grown for an additional 10 hat 16 °C before the cells were 
harvested, 

The harvested culture was centrifuged, andthe bacterial 
cell pellet was resuspended in PBS (140 mM NaCl, 27 mM 
KCI, 10 mM NasHPO,, 18 mM KH,PO,) and homogenized 
by sonication. The lysate was centrifuged at 18 000g for 20 
min at 4 °C, and the supematant was loaded onto a 
_luathione Sepharose 48 columm (Pharmacia). The protein- 
loaded column was then washed with 10 > column Volume 
of PBS. After that, for GST-fusion protein, the protein was 
luted with 10 mM reduced glutithione. For the GST- 
removed proteins, the GST-3C rhinovirus protease (kindly 
provided by Drs. J. Heath and K. Hudson) was added into 
the resin, and the misture was incubated with gente agitation 
for about 10 hat 4 °C. The target protein was eluted with 
10 mi of PBS. 

For SARS 2-Helix protein, expressed witha His-tag, the 
supernatants were loaded on a Ni~-NTA column (QUAGEN) 
that was equilibrated with PBS. The contaminated protein 
vas washed with washing buffer (1x PBS, 60 mM imida- 
ole) over 10 column volumes, and the target protein was 
luted with elution utr (1 PBS, 500 mM imidazole) for 
5 column volumes, The resultant 2-Helix protein was 
Concentrated toa proper concentration by ulttaitation (10 
Da cutoff, Proteins were analyzed by 10% Tricine SDS: 
PAGE 

Gel-Fitrarion Experimen. The targeted protein was loaded 
ona Superdex 75 column (Pharmacia) running on an Akta 
Explorer FPLC system (Amersham Pharmacia), The factions 
of the peak were collected and run on 10% Tricine SDS— 
PAGE. The molecular weights of the peak fractions were 
estimated by comparison with the protein standard running 
fn the same column 

GST-Pulldown Experiments, Purified GST-fusion protein 
(NTI=NT6) was mixed with GST-removed HR? in PBS at 
4.°C for 2h, and then the mixture was loaded onto the 
fluathione-Sepharose 4B column. The column was thor- 
fughly washed with 10 x column volume of PAS until no 
obvious proteins were seen and then eluted by 10 mM 
feduced glutathione. The collected peak was subjected to 
Tris.tricine SDS~PAGE analysis. 

Circular Dichroism (CD) Spectroscopy. CD spectra were 
measured on a Jasco J-715 spectrophotometer in PBS 
Wavelength specira were recorded at 25, 50,70, and 85 °C 


using a 0.1 cm path length cuvette, For the thermodynamic 
stability of the 2-Helix protein, it was measured at 222 nm 
by monitoring the CD signal in the temperature range from 
25 to 90 °C with a § °C increase for each scan. 

Chemical Cross-Linking. The gel-filtration purified 2-Helix 
proteins were dialyzed against cross-linking buffer (30 mM. 
Hepes, pH 8.3; 100 mM NaCl) and concentrated to about 2 
‘mg/mL by ultafilvation (10 kDa cutoff) Proteins were cross- 
linked with ethylene glycol bis(succinimidy! succinate) 
(EGS) (Sigma). The reactions were incubated for 1h on ice 
at concentrations of 0.1, 0.5, 1.5, and 5 mM EGS and then 
‘quenched with 50 mM glycine, Cross-linked products were 
analyzed under reducing conditions on 10% Tricine SDS~ 
PAGE, 

‘Model Building. The SARS fusion core structure mode! 
‘was built based on our recent crystal structure of MHY fusion 
core (31). All the amino acids of SARS core were substituted 
by those of MHV core. After several cycles of manual 
building and energy minimization, the model was built and 
the surface map was drawn by the program GRASP, 


RESUI 


'S AND DISCUSSION 


All Expressed Polypeptides Are Soluble Proteins. As 
described in the Experimental Procedures, nine expression 
‘constructs of GST-fusion polypeptides were prepared in this 
‘experiment (Figure 1), They were full-length HR1, NTI~6 
of HRI, HR2, and 2-Helix. All engineered constructs were 
verified by DNA sequencing, and the protein expression yield 
was high, AIL nine polypeptides were expressed in the 
supernatants of the sonicated cells, that is, they were soluble 
in their GST-fusion forms and easily purified by GST-affinity 
column (Figure 2a). In addition, all polypeptides were soluble 
‘even after the removal of the fusion partner, GST, with some 
of them showing aggregation at higher concentrations, 

N-Terminal Motety (Amino Acids 900-957) of the Pre- 
dicted HRI Binds HR2, From the LeamCoil-VMF program 
prediction, the HRI region covers over 100 amino acids, and 
itis unlikely that the whole region would interact with HR2, 
though we previously showed thatthe in-vitro-prepared full- 
length HRI and HR2 bound each other (21). Therefore the 
binding region(s) of HRI to HR2 was examined. To do this, 
six additional truncated HRI polypeptides (NT1~6) were 
‘expressed and purified in addition to the full-length HRI. 
GST pull-down experiments were carried out, and it was 
clear that NTI~5 bound to HR2 but NT6 did not (Figure 
2b», 

To refine the binding region and test the HRI-HR2 
binding affinity, CD spectra at different temperatures were 
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Ficune 3: CD spectra for refining the binding region of HRI—HR2. Panel a shows CD spectra of NTL-NTS and HR2 alone. Wavelength 
spectra were recorded al 25, 50, and 70 2. Panel b shows CD spectra of mixtures of NTI—NTS with HR2 in equal molar amounts 


Wavelength spectra were recorded at 25, $0, 70, and 85 °C. 


measured. As shown in Figure 3a, NTI~S all possessed 
‘chelical structure at 25 °C (typical double minima at 208 
‘and 222 nm) but with different thermodynamic stabilities, 
as shown by the altered absorptions following increase of 
the temperature from 25 to 50 and 70 °C, HR2 itself also 
possesses ochelical structure at 25 °C but becomes unstrue- 
tured when the temperature reaches $0 °C, indicating that 
the ochelix structure is not stable. Figure 3b shows CD 
spectra of five mixtures of NT1—5 with HR2 (in equal molar) 
at different temperatures. These complexes all possessed 
ochelical structure at 25 °C, Different thermodynamic 
stabilities were observed with altered absorptions following 
the increase of the temperature from 25 to 50, 70, and 85 
*C. The complexes of NT2—HR2 and NT3—HR2 were more 
stable than the other three complexes. We can conclude that 
the N-terminal moiety (amino acids 900-957) of the 
predicted HRI is the major region tht binds HR2. 

2-Helix Protein Forms Characteristic Stable Siv-Helix 
Bundle. Based on previous idenfication of the region of HRL 
that binds HR2, the HRI and HR2 regions of SARS-CoV 
spike protein have been chosen in this study as described in 
Experimental Procedures, A schematic representation of the 
SARS 2-Helix construct of HRI, linker, and HR2 is shown 
in Figure 1. The single-chain construct of the fusion core 
HRVHR2 was made as 2-Helix. As expected from our 
previous work on paramyxovirus fusion core 2-Helix proteins 
(22-24), the SARS 2-Helix polypeptide construct in this 
study was expressed at high level in the E. coli expression 
system, The protein has been shown to be soluble in either 
PBS or pH 8.0 Tris buffers and at concentrations of up to 
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Ficurr 4: SDS~PAGE analysis of the expression and purification 
of the 2-Helix proteins of SARS-CoV S protein: lane 1, nonin- 
‘duced; lane 2. IPTG-induced (final concentration is mht): lane 
3, puniied 2-Helix protein; lane M, molecular weight standards in 
ki, 


40 mg/ml. This implies thatthe protein bas most likely been 
folded correctly 

‘The SARS 2-Helix proteins were expressed in E, coll as 
soluble proteins (Figure 4). The supernatants of the cell 
lysates were loaded on a Ni-NTA column, and the target 
proteins were purified by affinity chromatography. The 
purified proteins were further analyzed by gel filtration and 
chemical cross-linking for estimation of the molecular 
weight. The SARS 2-Helix protein was eluted between the 
corresponding position of 52 and 14 kDa (Figure 5a), In 
‘comparison, the computed molecular weight of 2-Helix (with 
6 histidine tag) is bout 13 kDa, which indicates that the 
2-Helix might form oligomers (about 40 kDa). The complex 
of full-length HRI with HR2 was eluted just adjacent to 32, 
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a an ee ae’) 
ictus $: Gelfilration analysis of the purified SARS 2-Helix 
protein and complex of HR! and HR2. Panel a shows gel-filtration 
analysis of the purified SARS 2-Helix protein. The inset picture is 
‘of the protein collected from the peak analyzed on Tristricine 
10s SDS-PAGE. Panel by shows gel-filtration analysis of the 
complex of HRI and HR2, The estimated molecular weight was 
determined in comparison with the standards (14 and 52 kDa), 


kDa, also indicating that the fusion core might form 
oligomers (Figure 5B). Subsequent chemical cross-linking 
‘demonstrated that the SARS 2-Helix protein oligomer was, 
in fact, a timer (Figure 6a), although the monomer/dimer 
‘bunds could also be found in high concentrations of the eross- 
linker. In addition, the timer content increased as the 
concentration of the chemical cross-linker was increased. 
Dimer and trimer bands could also be seen if the samples 
(reduced by SDS loading buffer) were not boiled at 100 °C. 
‘These results indicate that HRI interaets with HR2 to form 
‘a stable trimeric complex. 

‘The secondary structure of the 2-Helix protein was 
‘examined by CD spectroscopy as described in Experimental 
Procedures. The absorption curve showed that the 2-Helix 
protein had double minima at 208 and 222 nm (Figure 7a), 
consistent with a typical cehelix. structure. ‘The thermal 
stability test by CD spectroscopy showed that trimer forma- 
tion of the 2-Helix was very stable in PBS (Figure 7b), 
indicating thatthe 2-Helix timer represents the core structure 
fof the postfusion state of the coiled coil bundle. The 
ppostfusion state has been studied in great detail for other 

ial fusion proteins (6, 25—29). 

SARS-CoV has been convincingly identified as a member 
‘of family Coronaviridae (1). Our results here show that their 
heptad repeat regions, HRI and HR2, interact with each other 
to form a typical stable six-helix coiled coil bundle, which 
‘epresents the fusion core of the fusion protein (6, 26). The 
six-helix bundle is extremely stable to thermal denaturation, 
‘The fusion core studied here has a melting temperature of 
‘about 85 °C and could not be separated easily in SDS— 
PAGE unless boiled at 100 *C with SDS~PAGE leading 
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Ficuae 6 Chemical cross-linking of the SARS-CoV 2-Helix 
protein with different concentrations of chemical crosslinker, EGS, 
(ethylene glycol bis(succinimidyl succinate) from Sigma). In panel 
2 erase linked products were scparaled on Tris-triine 10% SDS— 
PAGE followed by Commassie biliant blue staining. Protcin 
markers (M) are shown in kDa, Lanes 1 and 36 indicate the 
different concentrations ofthe EGS used (0, 0.1.05, 15, and SM 
respectively). Lane 2 represents 2-Helix protein samples (reduced 
by SDS loading buffer) not boiled at 100°C. Bands corresponding 
to monomer, dimer, and trimer are indicated. Panel b presents 


schematic diagram of crost-linking. 
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Ficuae 7: The CD spectra and thermal stability measurement of 
the SARS-CoV 2-Helix protcins in PBS. In panel a the 2-Helix 
proteins show a typical cheix secondary structure at 25 °C. In 
panel b,dhermal stability was recorded at 222 nm. The Ty is over 
‘50°C as the plateau was not reached up to 90 °C. 


buffer, Recent studies showed that peptides derived from 
HRI region or HR2 region of SARS-CoV spike protein are 
potent inhibitors for viral entry (2/,30), which is consistent 
With the observations made in this study. ‘These properties 
suggest that the six-helix bundle has similar structure with 
other viral fusion cores such as HIV gps and influenza HA, 
which have been studied in great detail (8, 1/, 13), 

‘Model of SARS-CoV Fusion Core Structure and Potential 
Fusion Inhibitor Design. We have recently determined the 
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Ficune & A model of the fusion core structure of SARS-CoV spike protein: (a) sequence alignment of HRI and HR? region between 
‘MHV and SARS-CoV spike proteins; (b) model of fusion core stucture of SARS-CoV S protein. A surface map showing the hydrophobic 
[grooves on the surface of thee central HRI is presented on the right side. Three HR2 helices pack agains the hydrophobic groove in an 
Antiparallel maancr (left side). The helical regions in HR2 extended regions could be observed clearly. The figure onthe right side shows 


the deep and relatively shallow grooves on the surface of the central HRI coiled coil. The helical region of HR2 just packs against the deep 
‘groove and the extended region packs against shallow groves, The deep groove woald be an important target ste forthe fusion inhibitor 


design, 


crystal structure of fusion core of the MH S protein, which 
‘epresents the first fusion core structure of any coronavirus 
(3D). The structure reveals a central hydrophobic coiled coil 
‘timer surrounded by three helices in an oblique, antiparallel 
manner. This structure shares significant similarity to both 
the low-pH-induced conformation of influenza hemagglutinin 
and fusion core of HIV gpd, indicating that the structure 
‘represents a postfusion state formed after several conforma- 
tonal changes. Our results further indicate that mechanisms 
for the viral fusion of coronaviruses are similar to those of 
influenza virus and HIV. 

Here, we propose a SARS-CoV fusion core structure 
model (Figure 8b) us a six-helix antiparallel bundle based 
‘on the following evidence. Fist, the sequence alignment of 
HRI and HR2 regions in spike proteins between MHV and 
SARS-CoV shows significant similarity in these heptad 
repeat regions (for HRI, identity is 60% and similarity is 
910%; for HR2, identity is 35% and similarity is 85%) (Figure 
a), highly conserved HRI and HR2 regions in coronavirus 
spike proteins suggest a similar three-dimensional structure 
between MHV and SARS-CoV fusion core. Second, residues 
inthe a d,e, and g positions in HRI and the a and d positions 
in HR? regions, which are very important for the coiled coil 
formation, are highly conserved (Figure 8a), Third, our 
previous biochemical analysis inthis paper indicates that the 
2-Helix construct (HRI and HR2 with a short Linker) of 
SARS-CoV spike protein is also a trimeric a-helix and is 


thermostable, Last, virus cell entry inhibition and cell~celt 
fusion inhibition experiments show that the HR2 region of 
SARS-CoV spike protein bas a similar function to C peptides 
in HIV-1 in viral fusion inhibition (27,30). We can therefore 
conclude that the SARS-CoV spike protein fusion core has 
1 three-dimensional structure similar to that of MHV. 

In this model, the complex is six-helix bundle comprising 
aa tvimer of 2-Helix. The six-helix bundle consists ofa parallel, 
twimeric coiled coil of three HRI helices with three HR2 
helices packing against the grooves formed on the surface 
of the HRI core (Figure 8b). The N terminus of HRI and 
the C terminus of HR2 are located at the same end of the 
six-helix bundles, In this patter, the likely role of the six- 
helix bundle structure is to facilitate juxtaposition of the viral 
snd cellular membranes by bringing the fusion peptide, which 
inserts into the cellular membrane, close to the transmem- 
brane segment, which is anchored in the viral membrane. 
‘Analogous to the HIV-1 C peptides, the HR2 region of 
SARS-CoV spike protein is likely to function in a dominant- 
negative manner by binding to the transiently exposed 
hydrophobic grooves in the prehairpin intermediate and 
‘consequently blocking the formation of the fusion-active 
hairpin structures (5). A similar approach may be applied to 
identify inhibitors of SARS-CoV infection since there are 
currently no effective agents for the antiviral therapy of 
SARS infection, 
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‘Table 1 PotenialInibitors for Veal Eairy 


7 ounce residues Sequence 
Pr TR Tas118 VDL GDISGINASVVNIQKEIDRINEVARNENESLIDLQE 
P2 HR L@o-1177 NIQKEIDRLNEVAKNLNE 

P3 HR Heo- 118s NIQKEIDELNEVAKNENESLIDLQE 

Pi HIRO + spacer! 1160-1200 NIQKEIDRLNEVAKNLNESLIDLOELGKYEQYIAWPWIVWL 
Ps HRI om-927 FNKAISQIQESLTTTSTAL 

Po HRI ‘QKQIANQENKAISQIQESLTTTSTAL 

P GCNA + HRI GCN4—ENKAISQIQESLTTISTAL 

Pe GCNA + HRI GCN QKQLANQFNKAISQIQESLTTISTAL 

Shei! HRI+ HR2 Po-linker-P¥linker-Po-linker Plinker-P6 

D-pepider phage library COENEN NN KK NONE 


“HRI, QRQIANQFNKAISQIQESLTTHISTALGELQDVVNONAQALNTLVKQLSSNF (002—082), HR2, DVDLGDISGINASVVNIQREIDRI 


NEVAKNLNESLIDLQE (1145-1180). "Spacer is the 


ion between HR2 andthe wansmembrane domain. The sequence of GCN is 


RMKQIEDKIFEIESKQKKIENEIARIKK (9) Poio-P3 linker, GGSGG; P3-0-P6 linker, GSSGG. (9). “D-peptide will be selected by mire 


image phage display using P7 as target (7) 


In the model of fusion core of SARS-CoV spike protein, 
residues 902—952 form the HRI region and residues 1145— 
1184 form the HR2 region, which packs against the 
hydrophobic grooves on the surface of HRI. Residues 1161— 
1177 (QKEIDRLNEVAKNLNE) form a typical a-belix 
structure, which packs against a deep groove consisting of 
residues 909-927 (FNKAISQIQESLTTTSTAL) in the HRI 
region (Figure 8b). The other regions in HR? form extended 
strands, which pack against the relatively shallow grooves 
‘on the surface of central HRI coiled coil (Figure 8b). On 
the basis of our biochemical analysis and molecular modeling 
in this study, we can propose an important target site (deep 
groove) for the fusion inhibitor design. Several strategies 
‘which have been successfully used in fusion inhibitor design 
for HIV may prove useful for treating SARS infection (5, 6, 
9, 10, 17, 18, 21, 30) 

HRI, HR2, and their derivatives could be potential 
inhibitors for vieal entry. Table 1 shows several potential 
inhibitors that we proposed on the basis of our biochemical 
results and molecular modeling. P1—P4 are designed from 
the HR2 region and PS~P8 from the HRI region. PI is the 
full-length HR2, which we propose will have high inhibitory 
‘activity but will be sensitive to proteolytic degradation. P2 
is a typical achelix region in HR2, which packs against the 
deep groove on the surface of central HRI coiled coil. The 
inhibitory activity of peptides derived from the HR2 region 
thas been reported in recent published papers (21,30), which 
is consistent with the structure model. P3 and P4 contain P2 
‘and are extended to include aelditional residues, P3 extends 
to the end of the HR2 region and P4 extends to residues 
near the transmembrane region, similar to T20 in C-peptide 
of HIV gp41 (18). P5 consists of residues forming a deep 
‘groove in the HRI region, which we think is a strong 
potential binding site for small molecule inhibitors. P6 
‘consists of residues from the N terminus of HRI to the C 
terminus of PS, the end of the deep groove. Both PS and PO 
‘are expected to have low inhibitory activity because they 
‘are highly hydrophobie and will tend to aggregate in solution, 
7 and P8 are derived from PS and P6, respectively, with 
the addition of GCN4 at the N terminus to increase their 
solubility and inhibitory activity. 5-Helix and D-peptide are 
also potential inhibitors for viral fusion, which have been 
successfully identified in studies of HIV gp41 (9, 17) 
Biological experiments to verify the inhibitory activity of 
these peptides and potent inhibitor design are underway. 


Comparison to Recent Results for the Binding Regions 
HRI and HR2 in SARS Fusion Core. Recent studies by Tripet 
and co-workers have shown that the site of interaction 
between the HRI and HR2 regions is between residues 916~ 
950 of HRI and residues 1151—1185 of HR2 (9). However, 
four results show that the binding regions of HR and HR2 
are residues 902—952 in HRI and 11451184 in HR2. The 
‘most significant difference between the two conclusions is 
that the fusion core that we defined includes residues 90: 
916 in HRI, which are particularly important for binding to 
HR2. These 14 residues form half of the deep groove for 
the binding of helical regions in HR2, Another difference is 
that the HR2 region that we defined includes residues 114 
1151, which also pack against the grooves formed by HRI 
by means of hydrophobic interactions, although these seven 
residues form an extended strand rather than an achelix. In 
our results, the HRI chain is slightly longer than the HR2 
chain but almost the same in length in the three-dimensional 
structure, Analogous to extended region of HR2 in SVS F 
(22), the extended regions of HR2 in SARS fusion core pack 
against the central HRI coiled coil, making HRI (40 
residues) have similar length as HR2 (51 residues). 
Conclusions. Biochemical and structural analysis of the 
sixchelix bundle provides a detailed picture ofthe fusion core 
structure of SARS spike protein and the underlying molecular 
‘mechanism for viral entry and fusion inhibition. This will 
also open a new avenue toward the structure-based fusion 
inhibitor design of peptides. or peptide analogues, for 
‘example, small molecules, targeting the newly emerging 
infectious disease, severe acute respiratory syndrome (SARS), 
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